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1 Introduction 



When the alphabet of a language L is linearly ordered, we may equip L with 
the lexicographic ordering. It is known that every countable linear ordering is 
isomorphic to the lexicographic ordering of a (prefix) language. 



The finite condensation [18J of a linear ordering is obtained by collapsing any 
two points that are at finite distance of one another. By applying finite con- 
densation iteratively, a fixed point of the condensation map may be reached at 
some ordinal called the finite condensation rank (FC-rank) of the linear order- 
ing. It is known [18] that for scattered orderings, the FC-rank agrees with the 
Hausdorff rank. 

The lexicographic orderings of regular languages (i.e., the regular linear order- 
ings) were studied in [U El El IH El [HI E3 CEO EES] • These linear orderings agree 
with the leaf orderings of the regular trees, and are all automatic linear order- 
ings as defined in [16]. It follows from results in [15] that all scattered regular 
linear orderings have finite FC-rank, and in fact all regular linear orderings 
have finite FC-rank [16]. Moreover, an ordinal is the order type of a regular 
well-ordering iff it is stricly less than 
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The study of the lexicographic orderings of context-free languages (context-free 
linear orderings) was initiated in [4] and further developed in [5J EJ [T2J dSl EE3] • It 
follows from early results in [9] that the lexicographic orderings of deterministic 
context-free languages are (up to isomorphism) identical to the leaf orderings 
of the algebraic trees, cf. [5]. In jl], it was shown that every ordinal less than 
uj^ is the order type of a well-ordered deterministic context-free language, 
and it was conjectured that a well-ordering is isomorphic to the lexicographic 
ordering of a context-free language if and only if its order type is less than . 
This conjecture was confirmed in [5] for deterministic context-free languages, 
and in [TJ] for context-free linear orderings. Moreover, it was shown in [6] 
and [H] that the FC-rank of every scattered deterministic context-free linear 
ordering and in fact every scattered context-free linear ordering is less than 
u u . Since the FC-rank of a well-ordering is less than exactly when its order 
type is less than , it follows in conjunction with results proved in (3] that 
a well-ordering is isomorphic to the lexicographic ordering of a context-free 
language or deterministic context-free language if and only if its order type 
is less than u u ". Exactly the same ordinals are the order types of the tree 
automatic well-orderings, see [10] . 

In this paper we consider all context-free linear orderings not just the scattered 
ones. By the above, the FC-rank of a contrext-free linear ordering is at most 
oj". Here we prove that it is strictly less than u w . 

For a study of lexicographic orderings of languages generated by deterministic 
higher order grammars we refer to [7]. 

2 Preliminaries 

A linear ordering [18] (/, <) is a set / equipped with a strict linear order 
relation <. As usual, we will write x < y for x, y 6 / if x < y or x = y. 
A linear ordering (J, <) is finite or countable if / is. A morphism of linear 
orderings is an order preserving map. Note that every morphism is necessarily 
injective. When (J, <) and (J, <') are linear orderings such that IC J and the 
embedding I <^->- J is a morphism, we call (J, <) a subordering of (J, <'). In this 
case the relation < is the restriction of the relation <' onto / and we usually 
write just / for (/, <). In particular, an interval of (J, <') is a subordering / 
with the property that whenever x <' y <' z and x, z G /, then y £ I. An 
interval / is called closed if there exist x,y in J with I = {z : x <' z <' y}. 
We denote it by [x,y]. 

An isomorphism is a bijective morphism. Isomorphic linear orderings are said 
to have the same order type. The order types of the positive integers N, negative 
integers N_, all integers Z, and the rationals Q, ordered as usual, are denoted 
cj, u*, ( and 77, respectively. As usual, the finite order types may be identified 
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with the nonnegative integers. 

Recall that a linear ordering (/, <) is dense if it has at least two elements and 
for every x,y £ I with x < y there is some z E I with x < z < y. A quasi-dense 
linear ordering is a linear ordering that has a dense subordering, and a scattered 
linear ordering is a linear ordering that is not quasi-dense. For example, N and 
Z are scattered, Q is dense, and the ordering obtained by replacing each point 
in Q with a 2-element linear ordering is quasi-dense but not dense. Clearly, 
every subordering of a scattered linear ordering is scattered. It is well-known 
that a linear ordering is quasi-dense iff it has a subordering of order type 77. 
Moreover, up to isomorphism, there are 4 countable dense linear orderings, the 
ordering Q of the rationals possibly equipped with a least or greatest element, 
or both. 

When (J, <) is a linear ordering and for each i E /, (J,, <j) is a linear ordering, 
the ordered sum 

i&I 

is the disjoint union [j ieI (Ji x {i}) equipped with the order relation (x,i) < 
(y,j) iff either i < j, or i = j and x <j y. When each (J;, <;) is the linear 
ordering (J, <'), we call the ordered sum the product of (/, <) and (J, <'), 
denoted (/, <) x (J, <'). Finite ordered sums are also denoted as (ii,<i) + 
•■■ + (I n ,< n ). Since the ordered sum preserves isomorphism, we may also 
define ordered sums of order types. For example, 1 + rj + 1 is the ordered type 
of the rationals equipped with both a least and a greatest element. It is known 
that every scattered sum of scattered linear orderings is scattered. This means 
that if (/, <) is scattered as is each (Jj, <j), then ^2 ieI (Ji, <«) is also scattered. 
A sum over a dense linear ordering (J, <) is referred to as a dense sum. 

Suppose that (J, <) is a linear ordering. We say that x,y E I are at a finite 
distance if the intervals [x, y] and [y, x] are both finite (with the convention if 
y < x then [x,y] is empty). 

For each ordinal a, we define an equivalence relation ~^ on J, together with a 
linear ordering (//~^, <£) such that for all x, y, x/~^<^ x < y and x 

and y are not related by the relation 

The relation ~q is the identity relation. When ot — f3 -\- 1 is a successor ordinal, 
then for all x,y G I, x ~^ y iff xj ~^ and yj ~^ are at a finite distance in 
<i). Suppose now that a > is a limit ordinal. Then for all x,y E I, 
we define x y iff there exists some (3 < a with x ~^ y. 

By Hausdorff's theorem [18], there is a least ordinal a = FC(J, <), called the 
finite condensation rank (or FC-rank) of (/, <) such that (//~„, <^) is either 
dense, when (J, <) is quasi-dense, or has at most 1 element, when (J, <) is 
scattered. In particular, every quasi-dense linear ordering is a dense sum of 
scattered nonempty linear orderings. 
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The following facts are known from [TS]. The first is stated in Exercice 5.12 
on page 82, the second in Lemma 5.14 on page 83, and the third immediately 
follows from the definition of the FC-rank. 

PROPOSITION 2.1 1. //(/,<) is a linear ordering and J is an interval of 
I, then for all ordinals a, ~^ is the restriction of ~£ onto J. 

2. If (I, <) is scattered with a subordering (J, <), then FC(J, <) < FC(7, <). 

3. If (I, <) is scattered and a is an ordinal such that FC( J, <) < a for all 
closed intervals J C I , then FC(J, <) < a. 

3 Lexicographic orderings 

We will consider countable linear orderings that arise as lexicographic orderings 
of context-free languages. Suppose that A is an alphabet which is linearly 
ordered by the relation <. Then we define a strict partial order < s on A* 
by u < s v iff u = xay and v = xbz for some x,y,z G A* and a,b G A with 
a < b. We also define u < p v iff u is a proper prefix of v, and u <i v iff 
u < s v or u < p v. The lexicographic order relation <i turns A* into a linear 
ordering. In particular, any language LCi* gives rise to the linear ordering 
(L, <i) called the lexicographic ordering of L. We say that a language L C A* 
is scattered, dense, etc. if its lexicographic ordering has the corresponding 
property. Moreover, we say that a lexicographic ordering is a regular or a 
context-free linear ordering if it is isomorphic to the lexicographic ordering of 
a regular or context-free language. The FC-rank FC(L) of a language LCA* 
is the FC-rank of its lexicographic ordering. 

We give some examples. 

Example 3.1 Consider the alphabet 2 = {0, 1} ordered by < 1. Then the 
lexicographic orderings of the regular languages 1*0, 0*1, + l + l + are of or- 
der type u, uj* and (, respectively, so that each is scattered of FC-rank 1. 
The lexicographic ordering of (00 + 11)*01 is 77. It is thus dense with FC- 
rank 0. The context-free linear ordering (U n >o l™0(l*0) n , <i) is of order type 
1 + uj + u 2 + . . . = and has FC-rank to. The context-free linear orderings 
(Un>i l n 0(0(0 + l + l + 0) + 10<"), < t ) and (U„>i l n 0(0(00 + ll)*01 + l(l*0)"), < e ) 
with respective order types ( + l + ( + 2 + ... and rj + u + rj + uj 2 + ... have 
FC-rank 2 and u, respectively. 

It is known that every countable linear ordering is isomorphic to the lexi- 
cographic ordering of a prefix language^ over the 2-element alphabet 2 not 

1 Prefix languages are sometimes called prefix-free. 
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containing the empty word e. Similarly, every context-free linear ordering is 
isomorphic to the lexicographic ordering of a context-free prefix language over 
2 not containing e. For this, consider an arbitrary context-free language L 
over an alphabet A. Take ± to be a fresh symbol assumed to be smaller than 
any symbol of A. The lexicographic ordering of the prefix context-free lan- 
guage LA. is isomorphic to (L, <q). To come back to the binary alphabet 2, we 
then use a standard encoding preserving the order on A. Thus, below we may 
restrict ourselves to such context-free languages which can all be generated by 
context-free grammars G = (N, 2, P, S) which do not contain useless nonter- 
minals and such that the right-hand side of each production is in 2 + N* and 
does not contain S. We say that such grammars are in weak Greibach normal 
form (weak GNF). The height of such a grammar is the largest integer n such 
that there is a sequence of nonterminals 

Xq , . . . , X n 

such that for each i < n, X i+1 is accessible by a derivation from JQ, but Xi is 
not accessible from X i+ i. 

Lemma 3.2 Suppose that L C 2* is a context-free prefix language generated 
by a grammar G in weak GNF of height n. Let u,v G L, and suppose that 
L[ u ,v] = {w E L : u < s w < s v} is scattered. Then the FC-rank L[ UjV ] is at most 
co n + l. 

Proof. It suffices to prove this claim when L\ UjV ] is infinite (and thus u < s v). 

Let G = (N, 2, P, S) be a grammar in weak GNF. Let S' be a new nonterminal, 
and consider all left derivations S =^>* w'p' =>- wp, where w, w' G 2* with 
u < s w < s v and w' is either a prefix of u or a prefix of v, moreover, p,p' G N*. 
There can only be finitely many such derivations as G is in weak GNF. For 
each of these we add a new production S' — > wp to P' . In addition we add the 
two productions S' — > u and S' — > v to obtain a new set of production P'. The 
resulting grammar G' = (N,2,P',S') generates the scattered language !/[„,„]. 
For the inclusion L(G') C L[ Uj „], note that a word generated by G' which is 
not equal to u or v is of the form wz with u < s w < s v. As by construction 
L(G') C L(G), we have L(G') C Lr UjU i. In order to prove the opposite inclusion, 
as L(G) is prefix and u and v belong to L(G'), we can restrict our attention to 
words x G L[ UiV ] such that u < s x < s v. Consider a left-most derivation of x 
over G, 

S Wtfi =>•••=>• W n p n 

with Wi G 2* and pi G X* for 1 < % < n, and w n = x and p n = e. Let £ be 
the least index such that w^ is neither a prefix of u nor a prefix of v. Clearly 
u < s W£ < s v and hence by the definition of G', S' — > w^Pe is a production of 
P' . It follows that G' generates x and hence that L[ UiV ] C L(G'). 
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Since the height of (the "reduced part" of) G' is at most n, the FC-rank of 
L[ UjV \ = L{G') is at most u n + 1, by the main result of [2]. □ 

Corollary 3.3 Suppose that L C 2* is a context-free prefix language gener- 
ated by a grammar G in weak GNF of height n. If Lq C L is a scattered interval 
ofL, then FC(L ) < u n + 1. 

Theorem 3.4 The FC-rank of a context-free language is stricly less than uj w . 

Proof. Without loss of generality suppose that L C 2* is a prefix context-free 
language. There exists a grammar G in weak GNF of height n generating L. 

When L is scattered, then the claim holds by [H]. So suppose that L is quasi- 
dense. Then L can be represented as a dense sum of nonempty scattered linear 
orderings. This means that L can be partitioned into nonempty scattered 
intervals Lj indexed by the elements i of a dense countable linear ordering 
(D, <) such that if i < j in D then u < s v holds for all u E Li and w G Lj. By 
the previous Corollary and clause (3) of Proposition 12. 1[ FC(Lj) < uj n + 1 for 
each i Thus, FC(L) < u n + 1. □ 

4 Conclusion and further research 

It has been known that the FC-rank of a regular linear ordering, even automatic 
linear ordering is finite, cf. [16] . In this note we established the result that the 
FC-rank of a context-free linear ordering is less than Our proof method 
relying on the corresponding result for scattered context-free linear orderings 
in [13] was the expected one with key ingredient being that the FC-rank of 
every closed scattered interval in the lexicographic ordering of a context-free 
prefix language L is bounded by u n , where n is a constant depending only on 
the grammar (in weak GNF) generating L. 

In [7], it is shown that scattered linear orderings generated by certain higher- 
order deterministic grammars (alias leaf orderings of trees definable by higher- 
order schemes) all have FC-rank strictly less than eo, the least ordinal a sat- 
isfying the equality a = u a . We expect that our methods carry over to prove 
upper-bounds on the FC-rank of linears orders defined by these higher-order 
deterministic grammars. 

Acknowledgements The authors would like to thank Alexander Kartzow 
for pointing a mistake in the proof of Lemma 13. 21 
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